NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

PACE: A Framework for Learning and Control in Linear Incomplete-Information Differential Games

Soltanian, Seyed Yousef; Zhang, Wenlong (May 2025, Proceedings of Machine Learning Research)

In this paper, we address the problem of a two-player linear quadratic differential game with incomplete information, a scenario commonly encountered in multi-agent control, human-robot interaction (HRI), and approximation methods to solve general-sum differential games. While solutions to such linear differential games are typically obtained through coupled Riccati equations, the complexity increases when agents have incomplete information, particularly when neither is aware of the other’s cost function. To tackle this challenge, we propose a model-based Peer-Aware Cost Estimation (PACE) framework for learning the cost parameters of the other agent. In PACE, each agent treats its peer as a learning agent rather than a stationary optimal agent, models their learning dynamics, and leverages this dynamic to infer the cost function parameters of the other agent. This approach enables agents to infer each other’s objective function in real time based solely on their previous state observations and dynamically adapt their control policies. Furthermore, we provide a theoretical guarantee for the convergence of parameter estimation and the stability of system states in PACE. Additionally, using numerical studies, we demonstrate how modeling the learning dynamics of the other agent benefits PACE, compared to approaches that approximate the other agent as having complete information, particularly in terms of stability and convergence speed.
more » « less
Free, publicly-accessible full text available May 30, 2026
Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models

He, Chengyang; Zhang, Wenlong; Chen, Violet Xinying; Ning, Yue; Wang, Ping (June 2025, IEEEACM Conference on Connected Health Applications Systems and Engineering Technologies)

Free, publicly-accessible full text available June 24, 2026
Natural Language Querying on NoSQL Databases: Opportunities and Challenges

https://doi.org/10.1109/BigData62323.2024.10825998

Zhang, Wenlong; Shi, Tian; Wang, Ping (December 2024, IEEE)

Full Text Available
Tactile-Based Exploration, Mapping, and Navigation With Collision-Resilient Aerial Vehicles

https://doi.org/10.1109/TMECH.2025.3572888

Patnaik, Karishma; Saravanakumaran, Aravind_Adhith Pandian; Zhang, Wenlong (August 2025, IEEE/ASME Transactions on Mechatronics)

Free, publicly-accessible full text available August 1, 2026
Natural Language Querying on Domain-Specific NoSQL Database with Large Language Models

https://doi.org/10.1109/BIBM62325.2024.10822485

Zhang, Wenlong; He, Chengyang; Yang, Guanqun; Bandyopadhyay, Dipankar; Shi, Tian; Wang, Ping (December 2024, IEEE)

Full Text Available
Dynamic Collision-Inclusive Modeling of a Multirotor Aerial Vehicle using Linear Complementarity Systems

https://doi.org/10.23919/ACC63710.2025.11107858

Abazari, Amirali; Kumar, Yogesh; Patnaik, Karishma; Zhang, Wenlong (July 2025, IEEE)

Free, publicly-accessible full text available July 8, 2026
Pontryagin neural operator for solving general-sum differential games with parametric state constraints

Zhang, Lei; Ghimire, Mukesh; Xu, Zhe; Zhang, Wenlong; Ren, Yi (July 2024, 6th Annual Learning for Dynamics & Control Conference)

The values of two-player general-sum differential games are viscosity solutions to Hamilton-Jacobi-Isaacs (HJI) equations. Value and policy approximations for such games suffer from the curse of dimensionality (CoD). Alleviating CoD through physics-informed neural networks (PINN) encounters convergence issues when value discontinuity is present due to state constraints. On top of these challenges, it is often necessary to learn generalizable values and policies across a parametric space of games, eg, for game parameter inference when information is incomplete. To address these challenges, we propose in this paper a Pontryagin-mode neural operator that outperforms existing state-of-the-art (SOTA) on safety performance across games with parametric state constraints. Our key contribution is the introduction of a costate loss defined on the discrepancy between forward and backward costate rollouts, which are computationally cheap. We show that the discontinuity of costate dynamics (in the presence of state constraints) effectively enables the learning of discontinuous values, without requiring manually supervised data as suggested by the current SOTA. More importantly, we show that the close relationship between costates and policies makes the former critical in learning feedback control policies with generalizable safety performance.
more » « less
Full Text Available
Solving Two-Player General-Sum Game Between Swarms

https://doi.org/10.23919/ACC60939.2024.10644320

Ghimire, Mukesh; Zhang, Lei; Zhang, Wenlong; Ren, Yi; Xu, Zhe (July 2024, IEEE)

Full Text Available
Design, Contact Modeling, and Collision-Inclusive Planning of a Dual-Stiffness Aerial RoboT (DART)

https://doi.org/10.1109/ICRA55743.2025.11128035

Kumar, Yogesh; Patnaik, Karishma; Zhang, Wenlong (May 2025, IEEE)

Free, publicly-accessible full text available May 19, 2026
Nonlinear Wrench Observer of an Underactuated Aerial Manipulator

https://doi.org/10.2514/6.2024-0660

Garrard, YiZhuang; Stoumbos, Tom; Inoyama, Daisaku; Zhang, Wenlong (January 2024, AIAA SCITECH 2024 Forum)

Unmanned aerial manipulators have been growing in popularity over the years, alongside the complexity of the tasks they undertake. Many of these tasks include physical interaction with the environment, where a force control or sensing component is desirable. In these types of applications, the forces and torques, or the wrench, acting on the robot by the environment must be known. This paper presents a wrench observer based on an Extended Kalman filter (EKF), and compares it against acceleration-based, momentum-based, and hybrid wrench observers. Simulations using each of these observers are conducted with an underactuated aerial manipulator composed of a hexarotor with coplanar propellers and a 2-DOF manipulator. Measurement noise on par with what is expected in real-world applications is added to the sensor signals, and results show that the EKF-based wrench observer is superior at noise reduction and wrench estimation in many cases compared to the other observers.
more » « less
Full Text Available

« Prev Next »

Search for: All records